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@ Data processing system. 

@ The system employs a fixed point arithmetic unit (ALU) • 
and a separate floating point arithmetic unit (FPU) controlled 
by a common microinstruction store. Most fields of the 80-bit 
microinstructions control both the ALU and the FPU, includ- 
ing a source field ALUS an operation field ALUOP. A further 
2-hlt field EXT determines whether the FPU performs double 
or single precision arithmetic in which the operation defined 
by ALUOP is carried out respectively by all of 16 4-tHt slice 
units and by the top 6 slice units only for 66 and 24 bit results, 
with 8 guard biu in either case. As well as a destination field 
ALUD for the ALU there is a separate destination field FPUO 
^ for the FPU and each such field is set to disable the output of 
^ its respective unit when the output is required from the other 
unit Carry-in bits for bits 55 and 23 are entered at bit 63 and 
1^ propagated through the lower bit slice unha to avoid 
^ multtptexsng In to the higher bit slice units. 

iO 
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DATO PROCESSING SYSTEM 

Ttlis invention relates to data processing systems and Is 
concerned more particularly witt the means used to perform arithmetic 
computations • 

Data processing systems utilize arithmetic mits for 
5 performing fixed or floating point calculations. one such system, 
for example, is utilized in a 32 -bit processing system made and 
sold by Data General Corporation of Westboro, Massachusetts under the 
Registered Trade Mark ECLIPSE MV/8000. Such system utilizes an 
arithmetic logic unit (ALU) which can perform both fixed point 

lO calcialations and floating point calculaticns. 

In performing floating point calculations, the 'arithmetic 
logic unit, of such system can be operated to produce either "single ' 
precision" accuracy wherein the floating point mantissa result 
comprises 24 bits (the remaining 8 bits being designated as "guard 

15 bits" for use in rounding operations) or "double precision" accuracy 
wherein the floating point mantissa result comprises 24 bits {the 
remaining 8 bits being designated as "guard bits" for use in 
rounding operations) or "double precisicMi" accuracy in which the 
calculation is effectively a 64-bit operation wherein the floating 

2o point mantissa result ccoiprises 56 bits (the remaining 8 bits being 
used for roimding in the same manner as in single precision 
operations) • In the present ECLIPSE MV/8OO0 system, double 
precision operations are performed by the arithmetic logic unit as 
two sequential 32-bit operations so that the overall time required 

25 for double precision is substantially greater than that required for 
single precision operation. 

It is desirable to be able to perform the doiible precision 
operation in substantially less time %n.thout greatly increasing the 
overall complexity or cost of the system in this regard. 

30 An appropriate technique for doing so which has been 

suggested involves the performance of floating point operations by a 
floating point arithmetic uni.t which is separate from the arithmetic 
xmit which performs fixed point operations. Ihe floating point unit 
can be capable of performing its operation on 32-bit or 64-bit words 

35, to achieve both single and double precision operations. Thus, such a 
system would utilize both, an arithmetic logic unit (ALU) and a 
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floating point unit (FPU) , each having its C3wn control store for 
storing its own microinstructions (i«e., microcode) which are used 
in performing the separate types of arithmetic calculations involved. 
Accordingly, the ALU control store would provide microcode for 
5 performing 32-bit fixed point operations while the FPU control store 
would provide microcode for performing both 32-bit and 64-bit 
floating point operations. Ihe time required to perform a complete 
double precision floating point operation would be reduced over that 
required to perform such double precision floating point operations 

lO using only a single arithmetic logic mit which can operate only on 
32-bit words at a time. Such an approach, however, requires the 
provision of an additional different control store with entirely 
different microcode for controlling the operation of the FPU 
independently and separately from the control store and microcode 

15 operation of the ALU. 

Alternatively, it heis been suggested that a single control 
store be used, which store, however, is considerably enlarged in 
comparison with that previously required when using a single 
arithmetic logic unit so as to provide completely different and 

20 separate sets of control fields for controlling the operation of the 
ALU and for controlling the operation of the FPU. The use of such 
an enlarged control store is required in order to accommodate a 
sufficient nimiber of control fields to assure that such xanits are 
independently and effectively separately controlled. 

25 It is desirable, however, to be able to provide the above 

desired operation without the need for two separate control stores 
or for stsch an enlarged control store. With a view to meeting 
this need, the present invention provides a data processing system 
as defined in claim 1 below. 

30 Thus, in practising the invention , fixed point operations, 

as well as single and doiible precision floating point operations, 
can be aAieved utilizing an ALU board and a separate FPU board, 
both of which are controlled through the use of a single, common 
control store having common control fields. In effect, then, 

35 control of both boards is achieved using substantially the same 
overall microcode, except for the microcode which relates to 
floating point operations. Ihe general configuration of the FPU 
board is substantially similar to the general configuration of the 
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ALU board and both boards share conuoon microinstruction fields 
for the control of their operations, the FPU board requiring the use 
of only two additional fields for controlling destination operations 
and for controlling single and do\3ble precision floating point 
5 operations. Such additional 3-bit and 2-bit fields, respectfully, 
can be readily accommodated in the single previously used control 
store without enlargement thereof. Ihe remaining microcode fields 
are common to both AUJ and FPU boards, the FPU board utilizing such 
microcode when a floating point operation is required (the ALU board 

lO being effectively disabled) and the ALU board using such microcode 
when non-floating point operation is required (the FPU board being 
effectively disabled) « 

A dovible precisian operation can be performed at a speed 
which is substantially greater than that required when using the 

15 conventional 32-bit single ALU board operation mentiraed above. 

Further, since si±>stantially the same microcode is used for both 
ALU and FPU operations, it is not necessary to rewrite, or to add 
to, the previoiasly used microcode for non-floating point operations 
except for minimal control field additions needed to distinguish 

20 between ALU or FPU operations, such operations otherwise using the 
same control fields as used for the previous ALU operation. 
Because the ALU board and the FPU board share the same fields, 
except for the added destination and single/double precision fields 
needed for the FPU, the ccmplexity and costs of the system are 

25 reduced over systems which require two complete separate control * 

stores or two conpletely separate sets of control fields of a single 
control store. 

Moreover, the system is readily adapted to iii5>lementing an 
improved technique for providing floating point arithmetic roimding 
30 operations in either single or double precision modes, and the 

invention coznprises also a data processing system as defined in claim 
11. 

The invention will be described in more detail, by way of 
example, with reference to th*e accompanying drawings, in which: 

Figs. 1 and 2 make up a block diagram of an arithmetic logic 
xanit as used in the prior ECLIPSE MV/80CX) unit; 

Fig. 3 and 4 make up a block diagram of a floating point 
\init in accordance with the invention; and 
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Figs. 5-36 depict specific logic diagrams for implementing 
the block diagram of Fig. 2 of the floating point unit of the 
invention. 

At the end of this description there appear two tables, 

namely: 

TABLE 1 shows a chart defining the control store mode (CSM) 
and extension (EXT) fields as used in floating point microinstructions 
in accordeuice with the invention; and 

TABI£ 2 shows a chart defining the comparable control store 
mode (CSM) fields of the previously used HV/8000 system. 

When \2sing a floating point unit for providing either single 
or double precision operation and calculating the mantissa value, the 
64-bit word which resiilts comprises the following x>ori:ions: 
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Bits 

During a single precision operation only 32 bits (bits 0^31) of the 
64-bit words are used (bits 32-63 are "zero) . The floating point 
man^ssa resiilt coinprises bits 0-23 while bits 24-31 are designated 
as ** guard** bits ii^ilch are utilized in a rounding operation, as 
discussed in more detail below. When providing double precision 
operation the mantissa floating point result comprises bits 0-55, 
with bits 56-63 operating as the "guard** bits thereof. 

In performing both fixed and floating point arithmetic 
operations in the aforesaid ECUPSE MV/8€X» system, a single arith- 
metic logic unit (ALU) as shown in broad block diagram form in 
Figs. 1 and 2 is used. The •operation thereof is known in the art 
with respect to the currently available HV/80CX) system. In 
accordance with the invention, a separate floating point unit (FPU) 
is used for performing floating point calculations, such unit having 
the broad block diagram configuration shown in Figs. 3 and 4. As 
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can be seen therein, the overall operational block diagram of the 
FPU is substantially similar to that of the previously uUlized ALX}. 

Control of the operation of the ALO in the previoiis MV/8000 
system is determined in accordance with specifically devised micro- 
code, a typical 80-bit microinstruction format as used therein, for 
exaisple, being described below: 
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In this format bits 74-78 were unused in the MV/800O system and the 
microcode field designated as the CSM (control store mode) controlled 

15 operations of the ALU for both fixed and floating poin-t mode. In 
accordance with the invention, control of both the separate ALU and 
the FPU boards is achieved through the use of essentially the same 
microcode control store and microinstruction format wherein the five 
previously untosed bits 74-78 are used to provide two additional 

20 fields for controlling the operation of the floating point unit 

destination registers (the FPUD field bits 74-76) and for controlling 
operation for either single or dotsble precisian results (the EXT 
field bits 77 and 78), as described in more detail below. 

Thus, in the invention the microinstruction fields which 

25 control the operation of the floating point unit aure the control 
store mode (CSM) field bits 28-31, the 3-hit bloating point unit 
destination FPUD) field bits 74-76, and the 2-bit extension (EXT) 
field, bits 77 and 78. As in the previous ECLIPSE MV/8000 unit, 
the operations for both the arithmetic logic unit and the floating 

30 point unit are cycled twice for each microinstruction (a first half- 
cycle and a second half-cycle) and the CSM, FPUD and EXT fields 
define the sources of control during each of the half-cycles. 

A srmmiary of the control store mode field and the extension 
field operations is shown in TABLE 1 for the various operations 

35 which are performed. As in the MV/8000 the floating point operation 
is performed in 4-bit slices, the overall 64-bit data words utilized 
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ocamprLslng sixteen 4-bit slices, the single precision floating point 
result conprising six 4-bit slices (the "upper" six slices) and the 
dotible precision 56-bit result comprising fourteen 4-bit slices 
(the "upper'' six slices and the "middle** eight slices) and the 
5 guard bits' for use in the. doioble precision mode comprising two 4-bit 
slices (the "lower" two slices} « In the first twelve mathematical 
operations performed, such operations can be either single or double 
precision operaUons as determined by the extension (EXT) field bits. 
In the control store mode and extension fields described in 

lO raBLE 1, the FMATH, TFi:(P, FGEN, FATO, BOOT, QDEC, QINC and QADD 
modes use both half-cycles 1 and 2 for their normal operations 
(e.g., A OP B B) . The remsdjiing modes perform two FPU operations. 
Most tests, and TIiCH (nibble shift input) look at only first half 
data d\iring these modes. Ihe SHUIBg SFIXP, SGEN and SATO modes 

15 perform an operation during the first half-cycle but utilize the 

data but (CBUS) during the second half-cycle. nhe NORM and PI?ESC 
modes are used to normalize and prescale the floating point operands, 
respectively, while the MPY mode inplements a double cycle 2-bit 
Booth's multiply algorithm and the DIV mode iirplements a doiable 

20 cycle non-restoring divide operation, the latter operat±ons both 
being conventional and well known in the art. 

The first twelve mathematical modes can be used for either 
double or single precision accuracy and such operation is determined 
by the EXT field as shown in TABIE 1. Thus, for a double precision 

25 operation, where all sixteen 4-bit slices are utilized, the micro- 
order from • the appropriate field of the specified microinstruction 
(ul) or the EBXXS use controls all sixteen slices. This is desig- 
nated in TABI«E 1 by the presence of ull6 or the designations DZ16 
and OR16. In the appropriate ALU and FPU fields, i.e., the ALUS 

30 (ALU source) , the ALtX>P (ALU operation field and the FPUD (FPU 

destination) field. The designation " during the first half 
cycle for the FPUD field merely indicates that no FPUD clocking 
takes placed (i.e., there is no FPU destination oepration for such 
mode during the first half-cycle) and the designation "§16" indicates 

35 that the micro-order will defer to a pre-decoded or "forced" value. 

Pt>r a single precision operation the lower ten 4-bit slices 
(bits 24-63) are zero since only bits 0-23 are used. In the 
particular operation of the floating point unit an "AND" operation 
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in which one of the operands is zero at each bit slice provides for 
a propagation of an, input carry (CRY) bit through each bit slice to 
the pulqput carry bit so that the FPU "CRYIN'' is propagated through- 
out the lower ten 4-bit slices for single precision operation. Uie 
basis for such operation of the FPU computation circuit in this 
regard is described in more detail below with respect to the 
description of rounding techniques discussed at a later point herein. 
While zeros are present in the lower ten 4-bit slices the micro-order 
from the appropriate field of the specified microinstruction (ul) 
controls the upper six 4.-bit slices. For example, in the first 
half-cycle for all of the four Fxxx modes shown in TABLE 1, single 
precision operation (EXT field 2) is designated by the legends 
"U16/DZ10" (indicating the operands of the V5>per six slices are under 
microinstruction control and the operands of the lower ten slices 
are the DBUS and zero) for the ALUS field and "ulS/ANDlO" (indicating 
the operands of the upper six slices are under microinstruction 
control' and each of the lower ten slices receive an AND operation) 
for the ALUOP field. rhe FPUD field is not used in the first 
half-cycle, while such field is under microinstruction control 
20 during the second half-cycle. 

The Control StoVe Mode for controlling operation of the ALU 
utilizes substantially the same microcode, as shown by the CSM 
characteristics depicted in TABLE 2 (which is also the control store 
mode set up to control operation in the previously designed 
25 ECLIPSE MV/8000 system). As can be seen therein,. where there is no 
separately defined single or double precision operation for the ALU, 
effectively the same CSM operations are performed as for -the FPU 
operation and no EXT field is required. Thus, for ALU operation 
control field bits 74-78 of the above microinstruction are not 
utilized (no FPUD field and no EXT field controls are required) . In 
each case, for both ALU and FPU operations^ the ALUS and the ALUOP 
field values represent exactly the same operations as shown below. 

The ALU is cycled twice for each micro-instruction. ohe CSM 
field defines the sources of control during the two half cycles. 

The Fxxx modes use both cycles for normal A + B B operation . 
The remaining modes perform 2 ALU operations. Most tests, LA, and 
TLCH (nibble shift input) look at only first half data during these 
njodes, Sxxx modes are usefiil for setting up tests or memory starts 
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with the second half used for reading memory, O-DS, or the niJbble 
shifter. BOUT is similar but allows memory write in the second half* 
' HkxB Qxxx modes allow two registers to be loaded in one cycle. Ttie 
Q register is genereJ-ly used to contain an address which will be 
5 incremented or decremented before each memory start. MPY and DIV 
implement double cycles of the multiply and divide algorithms. 
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For the ALU operations the ALUD field and for FPU operations 
the FPUD field are also shown below. 



ALUD FIELD 
Mnem Value 
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Description 



No load; Y < 0-31 > « ALU<0-31> 

Load QREG only; Y < 0-31> « ALU <0-3r> 

Load BKEG only; Y < 0-31> = ALU <0-31> 

Load BREG only; Y < 0-31> = AREG < C>-31> 

Load BHEG with ALU shifted jri^t one bit-; 

LINK register s« ALU31? Y <0-31> = ALU<0-31> 

Load BHEG with ALU shifted right one bit; 

Shift QREG right; Y< 0-31 > » ALU <0-31 > 

LINK register :« ALU 31 

Load BREG with ALU shifted left one bit; 
Y<0-31> = ALU<0-31> 

LINK gets ALU16, ALUO for FLAGO = 0,1 respect- 
ively. 

Load BREG with ALU shifted left one bit; 

Shift QREG left; Y < 0-31> = ALU < 0-31 > 
LINK gets ALU16, ALUO for FLAGO - 0,1 respect- 
ively. 



FPUD FIEU) 
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Mnem Value Description 

O No load; Y < o-63> - FPU < 0-63 > 

QREG 1 Load QREG only, Y< o-63> =FPD<0-63> 

BREG 2 Load BREG only, y< o-63> = FPU <0-63 > 

AOUT 3 Load BREG only, y< 0-63 > = ^^^^ ^ 

^™ * BREG with IPU shifted right tme bit, 

BREG<0-63> s= o, FPU < 0-62 > 
LINK := O 

RSQB 5 load BREG, QREG with FPU. QREG shifted right one 

Dit; 

BRE6<0-63> :=o, FPU< 0-62 > 
QREG<0-63> :=FPD63, QREG <0-62> * 
LINK :« O 



LSHB 



Load BREG with FPU shifted left one bit- 
BREG<0-63> := FPU <l-63 >, QREGO 
LINK := FPDO 



LSQB 7 load BREG, QREG With FPV, QREG shifted left one bit: 

BREG<0-63> := FPU <l-63> QREGO 
QREG<0-63> := QREG < 1-62 > ,CRYY 
LINK 1= FPOO 

In each case (for the ALUD field and the FPUD field) a zero 
value is a no load operation. ihus, when the FPU board is being 
utilized for a floating point operation by the system, the ALUD field 
has a zero value so that no ALU calculations can be loaded into any 
destination register, while the FPUD field assumes a value which 
identifies an appropriate destination register into which a floating 
point answer can be loaded in accordance with the FPUD field above. 
However, if data is being transferred between -the ALU and the n«U, 
the microcode can be written to allow loading of a destination 
register on each board, according to the algorithm appropriate for 
the instruction being implemented. 

In the same manner, when the ALU board is being utilized for 
a fixed point operation by the system, the FPUD field has a zero 
value so that no FPU calculations can be loaded into any destination 
register, while the ALUD fielj3 assumes a value which identifies an 
^ropriate register into which an ALU result can be loaded in 
accordance with the ALUD field above. 

Accordingly, substantially the same microcode is used for 
non-floating p6int operations. Floating point microcode is 
rewritten to control the FPU maintaining identical algorithms so that 
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results with ALU only or ALU and FPU are the saioe* This requires 
an additional 2-bit field (the EXT field) in order to provide for 
suitable single or dotible precision floating point operation and an 
additional 3-bit field (the FPUD field) in order to identify FPU 
5 destination operations. Because the same microcode is used for 
each board, only a single znicrocontrol store is needed for storing 
the microinstructions required for controlling the operation of 
both boards. For double precision operation, the FPU board 
operates on a 64-bit word and, therefore, the overall double 

lO precision floating point calculation can be performed in less time 
than was required for the previous MV/8000 operation when the ALU 
board was used to provide both floating point single and a doxible 
precision calculations but could only do so by operating on 32-bit 
words at one time. However, the results produced are identical* 

15 In addition to providing a separate board for performing 

single and doiible precision floating podLnt cozoputatlons, as discussed 
above, the floating point unit also is designed so as to provide 
"rounding" of the single or double precision floating point mantissa 
results in a unique manner as described below. As is well known, 

20 a conventional floating point mantissa calculation for addition or 
subtraction can occur in four cycles, as set forth below: 

1. A comparison of the floating point operands is made 
to determine which is larger 

2. Itie smaller operand is pre-scaled so as to align 
25 Its radix point 

3. Hie floating point operation is performed and 
normalized so as to produce an unroxmded result 

4. Ihe unrounded result is roimded and the rotanded result 
Is normalized 

30 In a floating point mantissa operation a round bit .can he 

appropriately added into the least significant bit of the floating 
point result (e.g., bit 23 of the single precision operation or bit 
55 of a double precision operation) . Itechniques for performing such 
rounding operation are discussed in our copending European Patent 

35 Application No. 82302080.5. As discussed therein the calculation 
is performed In 4-bit slices with appropriate -carry" bits for each 
slice being effectively Inserted into the next higher adjacent slice, 
as Ahown below, (CSY0 Is the carry bit for the overall result) . in 
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an act.ual circuit implementation, for example, carry look-ahead 
circuits can be used to sv^iply the carry-in bits to each sUce. 
CRy0 CRy4 CRYB CRY12 CRY16 CRY20 CRY24 



CRY 52 CRY56 .CRY.60 CRYIN 



0 



3 4 7 8 11 12 15 16 19 20 23 



52 55 56 .59 60 63 



The floating point computation can be performed by using a 
bit-slice integrated circuit which is available under the designation 
2901 made by several manufacturers (e.g., IDM 2901A-1 ic ship made by 
National Semiconductor Corporation of Santa Clara, California) . An 
"AND" operation (fcir a 4-bit slice) performed by such well-known 
circuit can be described as follows. Let the bits of the 4-bit 
slice operands be designated as R^, R^, R^, r^ and S^, S^, and S^, 
respectively. For such AND operation, the propagate (P) output is 
"low" and the generate (G) output is defined by the Boolean equation: 



G = R^S^+R^S^+R^S^+R^S, 



O O "1"1 ":2"'2 "3"3 
and the carry out {CRYOUT) output is defined by the Boolean, equation: 

CRYOUT = CRYIN + ^o^q+^iS^+R^S^+R^S^ 

The rovtnding operation to be described in accordance with the 
invention herein makes use of the above-described operation of the 
2901 IC circuit in providing a rounding technique which avoids the 
extra time normally required in conventional rounding techniques 
which effectively reqxiire a multiplexing operation when inserting the 
round bit at the proper slice (as discussed in the aforesaid co- 
pending application) . " 

Thus, the invention recognizes that in the above operation 

of the 2901 unit, if one of the operands is zero and the unit is 

performing an AND operation the carry in bit is propagated through 

the slice as the carry out bit so that 

CRYOUT » CRYIN 
P = 1 (P is low) 
G = 0 (G is hi^) 

That is, whatever is inserted as a CRYIN bit to the 4-bit 
slice effectively from the next lower adjacent 4-bit slice is 
propagated directly through the slice and is provided as the CRYOUT 
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bit thereof. In an actual circuit iispleiDentation, for exanple, 
this propagation can be accomplished by carry look-ahead circuits. 
Carry look-ahead circiiits will propagate a carry (P=l) bit, but not 
generate one (G=0) . 

Therefore, in order to add the round bit into the appropriate 
bit position, bit 23 for a single precision memtissa computation and 
bit 55 for a double precision mantissa confutation, the romd bit 
(suitably calculated as described, for example, in the aforementioned 
copending application) can be inserted at bit 63 (the least 
significant bit) as the CRYIN bit of the lower 4-bit slice. 

For a single precision mode for each of the middle eight 
4-bit slices (bits 24-55) and the lower two 4-bit slices «(bits 
56-63)., one of the operands .is set to zero and each is controlled to 
perform an AND operation so that whatever is inserted as a CRYIN bit 
at bit 63 is propagated effectively through each of such slices as a 
CRYOUT bit* 

Accordingly, if the round bit is added as the "carry in" 
bit CRYIN, such round bit is propagated effectively through the lower 
two and the middle eight 4-bit slices so as to be added as the carry 
in bit CRY24 at bit 23 of 4-bit slice 20-23 (the lowest 4-rbit slice 
of the unrounded result) so that the rounding operation is performed 
without the need to utilize an effective multiplexing operation at 
bit 23. 

In the s<UDe manner, for a double precision floating point 
mantissa computation, the lower two slices (bits 56-63) are arranged 
during the rounding cycle to have one of their operands set to zero 
and are controlled to perform AND operations. A round bit inserted 
as the CRYIN bit at bit 63 is then effectively propagated through the 
lower two slices and inserted as the carry in bit CRY56 (the roimd 
bit) at bit 55 of the lower 4-bit slice (bits 52-55) of the unrounded 
result to complete the rounding of the final 56-bit floating point 
mantissa resialt. 

In the same manner, during non-rounding cycles in a single 
precision operation, the CRYIN bit set by microcode is effectively 
propagated to bit 23 as described above, since according to "B^liE 1 
the lower ten slices have one of their operands set to zero and are 
controlled to perform AND operations. 

Specific logic which is used to inclement the floating point 
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unit board of Fig. 3 and 4 is shown in Figs, 5-36. lh\js, the 4-bit 
microprocessor slice logic is depicted in Figs. 5-12. Figs. 13-16 
show the 4-bit slice control logic and the control store mode (CSM) 
and extension (EXT) decode logic. Figs. 17-18 show various logic 
5 such as the look-ahead carry logic, floating point status register 
logic, test logic, random field logic and the central processor data 
(CPD) bus control logic. Figs. 19-20 depict logic associated with 
the operation of the multiply register (MREG) and multiply 
accumulator (MAOC) of Figs. 3 and 4. Figs. 21-23 show logic 

lO associated with the T-latch \anit and the CPD bus interface logic 

depicted in Figs. 3 and 4. Figs. 24-26 show logic associated with 
the coarse nibble rotator, the Z-latcii logic and the shift enable 
logic of Figs. 3 and 4. Figs. 27-29 show logic associated with the 
fine nibble rotator of Figs. 3 and 4 and further shift control logic. 

15 O^ie nibble shift multiplexer units and the processor memory data bus 
interface logic are shown in Figs. 30-32. D-bus enable logic, 
dispatch logic, normalization logic and the roxmd logic for 
generating the rotmd bit SET ROUND are depicted in Figs. 33-34. The 
ACSR (source register) , ACDR (destination register) , A-register and 

20 B-register logic of Figs. 3 and 4 are shown in Figs. 35-36, as well 

as various clock logic required for FPU operation. The use by those 
in the art of such specific logic as depicted in Figs. 5-36, together 
with the foregoing description, will permit one in the art to 
practise the invention and to fabricate an FPU board thereof in 

25 accordance with the invention, as set forth above. 
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TABLE Z 

CSM » EXT - Control Store Mode & Extcnslc^i Fields 



CSN 


EXT 


Half-cycle. 1 




Half-cycle 2 




Mncin 


Val 


Mnem 


Val 


7VLUS 


ALt}0P 




ALUS 


ALUOP 


FPUD 


5MATH 


o 


-J 
















SFIXP 


1 


f DEI. 


0 


UI16 


uI16 


*JSJ 

W 


DZ16 


OR16 


Ul 


SGEN 


2 


/ SGL 


2 


UI6/DZ10 


uI6/;iND10 


Tr 


DZ16 


ORC/ANDIO 


UX 


SATU 


3 


J 












PMATH 


4 


















Ff 1 aP 


D 


I DBI> 


O 


uXlo 


uIId 




uI16 


uI16 






b 




•> 


uXo/DZlO 


UI6/AND10 




UI6/DZ10 


UI6/AND10 


ul 


FATU 


7 


















A 


















QDEC 


C 


1 DEL 


0 


uI16 


Ull6 




UI16 


UX16 


uX 




n 


1 






UXD/ AMUXQ 


w 


uxo/D«xO 


UZ 6 /AN DIG 


ux 


QADO 


E 
















MPy 


8 


NOP 


0 


016 


016 




016 


016 


\lX 


MPY 


8 


-. FST 


1 • 


ei6 


016 




016 


' 016 


Ul 


DIV 


9 


NOP 


O 


UI16 


016 


ul 


uI16 


016 


ul 


NORM 


B 


DBL 


0 


uI16 


UI16 


# 


DZ16 


ORIS 


ux 


NORM 


B 


ADS 


1 


11I6/2Q10 


UI16 




DZ16 


ORIS 


ul 


NORM 


B 


SGL 


2 


ta6/D210 


UI6/AND10 




DZ16 


ORIS 


uX 


PRESC 


P 


WTR 


c 


@16 


UI16 




DZ16 


ORIS 


uX 


PRESC 


F 


NTR 


1 


@16 


UI16 




DZ16 


ORIS 


uX 



Xn the tables the following abbreviations are used: 

uX - Represents the u-order from the appropriate field of 
the specified instruction, 

ull6 - The u-order from the appropriate field of the 

specified u-instruction controls all 16 slices. 

<specif ied>16 - The specified u-order from the above table controls 
all 16 slices. 

# - No clock takes place, but the FPUD u-order still 
controls the AOOT multiplexer, i.e. <0-31> still 
gets the value centraled by ul. 

0 (016) - Ihe u-order will defer to a predecoded or "Forced" 
value (for all 16 slices) . • 

uI6/<specified>10 - The u-order from the appropriate field of the 

specified u-instruction controls the most 
significant 6 slices. The least significant lO 
slices are forced to the <specified> u-order from 
the above table. 
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TABLE 2 



Mnem 



SMATH 
SFIXP 
SGEN 
SATU 

FMATH 
FFIXP 
FGEN 
FATU 

MPY 
DIV 
BOUT 
NOKM 

QDEC 
QINC 
QADD 
PRESC 



Val 



O 
1 
2 
3 

4 
5 
6 
7 

8 
9 
A 
B 

C 
D 
£ 
F 



Half-cycle 1 
ALUS ALUOP ALUD 



ul 
ul 
ul 
Ul 

ul 
ul 
ul 
ul 

« 

ul 
ul 
ul 

2Q 
ZQ 
DQ 
@ 



ul 
ul 
ul 
ul 

ul 
ul 
ul 
ul 



ul 
ul 

SUB 
ADD 
ADD 
OR 



QREG 
QREG 
QREG 



Half-cycle 2 
ALUS ALUOP ALOD 



DZ 
DZ 
DZ 
DZ 

UZ 
ul 
ul 
Ul 

. ul 
ZB 
DZ 

ul 
ul 
ul 
DZ 



OR 
OR 
OR 
OR 

ul 
ul 
uX 
ul 

e 
e 

OR 
OR 

ul 
ul 
ul 
OR 



Ul 

ul 
ul 
ul 

ul 
ul 
ul 
ul 

ul 
ul 
ul 
ul 

uX 
ul 
ul 
ul 



DIST 
Type 



Math 
Gen 
Gen 
Gen 

Math 
Gen 
Gen 
Gen 

Math. 
Math 
Gen 
Math 

Gen 
Gen 
Gen 
Math 



CRYIN 


IVpe 


TVpeO 

TypeO 
TypeO 


Math 
Fixp 
Gen 
Atu 


TypeO 
Typel 
l^peO 
IVpeO 


Math 
Fixp 
Gen 
Atu 


Type2 
^pe3 
TypeO 
TypeO 


Math 
Math 
Gen 
Math 


* TypeO 

* TypeO 
*TypeO 

TypeO 


Gen 
Gen 
Gen 
.Math 
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CLAIMS 



1. A data processing sjystem handling both fixed point and 
floating point computation operations, comprising an arithmetic unit 
for profiding fixed point arithmetic computations, a floating point 
unit for providing floating point arithmetic computations and a 
single control store means for supplying microinstructions to control 
the operations of both the units, characterised in that both of the 
units are responsive to a plurality of ccaranonly shared control fields 
of the microinstructions during their respective operations and in 
that the floating point unit is further responsive to at least one 
additional control field diaring its operation. 

2. A system in accordance with claim 1, characterised in that 
the arithmetic computation unit is responsive to a selected value of 
a first control field for preventing the use of any fixed point 
computation results from the arithmetic unit when the floating point 
unit is supplying its floating point computation result to the data 
processing system. 

3. A system In accordance with claim 1 or 2, diaracterised in 
that the floating point unit is responsive to a selected value of a 
second control field for preventing the use of any floating point 
computation results from the floating point unit when the arithmetic 
unit is supplying its fixed point computation result to the data 
processing system. 

4. A system in accordance with claim 3, the first and second 
control fields are fields which identify respectively the destination 
of a fixed point computation result from the arithmetic unit and the 
destination of a floating point computation result from the floating 
point unit. , 

5. A system in accordance with claim 4, characterised isx that 
the selected value of the arithmetic destination control field is set 
for a no-load operation when the floating point unit is supplying its 
result to the data processing system and the selected value of the 
floating point destination control field is set for a no-load operation 
when the arithmetic unit is supplying its result to the data processing 
system* 
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6. A system in accordance with any of claims 1 to 5, 

characterised in that a common control field of the control store is 
used to identify the source of data which is to be supplied either 
to the arithmetic unit when it is computing a fixed point result or 
to the floating point unit when it is computing a floating point 
result. 



7. A system in accordance with claim 6, characterised in that 
a common control field of the control store is used to identify the 
operation %^ich is to be performed by either the arithmetic unit when 
it is performing a fixed point operation or the floating point unit 
when it is performing a floating point operation. 

8. A system according to any of claims 1 to 7, characterised 
in that the said at least one additional control field comprises a 
control field for iden tiding whether the floating point unit is 
operating on data words of a first or a second number of data bits. 

9. A system in accordance with claim 8, characterised in that 
the first number is 32 bits for providing a 24 bit floating point 
mantissa result and the second number is 64 bits for providing a 56 
bit floating point mantissa result. 



10. A system in accordance with claim 9, characterised in that, 
when the control field identifies a 32 bit operation, the field 
controls the operation of the floating point unit in a manner such 
that the operands of the coinputation used for the first 24 bits of 
the floating point words are controlled by a microinstruction from 
the control store and the remaining 40 bits of the floating point 
words are zero. 

11. A data processing system having a floating point unit for 
providing floating point comeutation operations, the floating point 
unit including processing means for providing an overall comptitation 
result comprising a first plurality of bits representing a desired 
floating point cozxqputation result and a second plurality of bits not 
used in this desired conputation result;, and means for providing a 
carry-in bit to be added to the floating point ccoputation result i 
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diaracterised by means for adding the carry-in bit to the least 
significant bit of the overall computation result and for causing 
the carry- in bit to be effectively propagated through the processing 
means so as to be added to the least significant bit of the first 
plurality of bits to form the floating point computation result. 

12. A system in accordance with claim 11, characterised in that 
the processor means conprises a plurality of multiple-bit slice 
processor units each processing selected bit slices of floating 
point operands to provide the overall coxsputation result, the overall 
result being formed of a plurality of bit slices each effectively 
provided with a carry-in bi't from an adjacent lower significant bit 
slice processor unit and effectively sxxpplying a carry-out bit to an 
adjacent higher significant bit slice processor lanit, the carry-in 
bit added to the least significant bit of the overall result being 
effectively propagated through the plurality of bit slice processors 
which form the second plurality of bits so as to be added to the bit 
slice which forms the least significant bit slice of the first 
plurality of bits, 

13. A system in accordance with claim 12, characterised in that 
the processing means has an operating characteristic stich that when 

a bit slice processor unit thereof is placed in an AND operating 
condition and at least one of the operands of the bit slice 
processor unit is set to zero, the carry-in bit is effectively 
propagated through the bit slice processor unit as the carry-out bit 
thereof. 

14. A system in accordance with claim 13, characterised in that 
each bit slice processor unit is capable of asserting a propagate bit 
or a generate bit and the means for adding the carry-in bit and for 
caiising the carry-in bit to be effectively propagated through the bit 
slice processor unit includes a plurality of look- ahead logic means 
respcnsive to the bit slice processor units whereby, when a bit slice 
processor unit asserts its propagate bit but does not assert its 
generate bit, the look-ahead logic means associated therewith propa- 
gates its carry- in bit but does not generate a carry-out bit. 
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15. * A system in accordance with claiia 13 or 14, characterised 
in that each of the bit slice processor ixnitfe whiA form the second 
plurality of bits is placed in an AND operating condition and at 
least one of the floating point operands in the bit slice processor 
units is set to zero, whereby the carry-in bit is effectively 
propagated through each said bit slice thereof, the carry- in bit 
being added to the least significant bit of the first plurality of 
bits to form the floating point computation result. 

16. A system in accordance with any of claims 12 to 15, 
characterised in that the bit slice processor units each process 
4 bits of the floating point operands. 

17. A system in accordance with any of claims 12 to 16, 
characterised in that the overall floating point computation result 
comprises 64 bits. 

18. A system in accordance with claim 17, characterised in 
that the first plurality of bits conqprises 24 bits and the second 
plurality of bits comprises 40 bits. 

19. A system in accordance with claim 17, characterised in that 
the first pliirality of bits comprises 56 bits and the second plurality 
of bits coxi^>rises 8 bits. 

20. A system in accordance with any of claims 11 to 19, 
characterised in that the floating point coxnputation result is an 
unrounded result and the means providing 'tiie carry-in bit comprises 
means for providing a round bit as the carry-in bit which is added 
to the least significant bit of the overall computation result, the 
round bit thereby being added to the unrounded, result to produce 

a rounded floating point computation result. 

21. A system in accordance with any of claims 11 to 19, 
characterised in that the means providing the carry-in bit cozoprises 
means responsive to a carry-in bit si^plied from a source external to 
the floating point computation xmit as the carry-in bit which is 
added to the least signlficcmt bit of the overall coxnputation result. 
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the externally 8\^plied carry-in bit thereby being added to the 
floating point coinputation result. 
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